A Machine Learning Approach to German Pronoun Resolution
نویسنده
چکیده
This paper presents a novel ensemble learning approach to resolving German pronouns. Boosting, the method in question, combines the moderately accurate hypotheses of several classifiers to form a highly accurate one. Experiments show that this approach is superior to a single decision-tree classifier. Furthermore, we present a standalone system that resolves pronouns in unannotated text by using a fully automatic sequence of preprocessing modules that mimics the manual annotation process. Although the system performs well within a limited textual domain, further research is needed to make it effective for open-domain question answering and text summarisation.
منابع مشابه
A Machine Learning Approach to Pronoun Resolution in Spoken Dialogue
We apply a decision tree based approach to pronoun resolution in spoken dialogue. Our system deals with pronouns with NPand non-NP-antecedents. We present a set of features designed for pronoun resolution in spoken dialogue and determine the most promising features. We evaluate the system on twenty Switchboard dialogues and show that it compares well to Byron’s (2002) manually tuned system.
متن کاملSupervised Ranking for Pronoun Resolution: Some Recent Improvements
A recently-proposed machine learning approach to reference resolution — the twin-candidate approach — has been shown to be more promising than the traditional single-candidate approach. This paper presents a pronoun interpretation system that extends the twin-candidate framework by (1) equipping it with the ability to identify non-referential pronouns, (2) training different models for handling...
متن کاملDisambiguation of the Neuter Pronoun and Its Effect on Pronominal Coreference Resolution
Coreference resolution, determining the appropriate discourse referent for an anaphoric expression, is an essential but difficult task in natural language processing. It has been observed that an important source of errors in machine-learning based approaches to this task, is the wrong disambiguation of the third person singular neuter pronoun as either referential or non-referential. In this p...
متن کاملA Machine Learning Approach to Portuguese Pronoun Resolution
Anaphora resolution is an essential component of most NLP applications, from text understanding to Machine Translation. In this work we discuss a supervised machine learning approach to the problem, focusing on instances of anaphora ubiquitously found in a corpus of Brazilian Portuguese texts, namely, third-person pronominal references. Although still limited to a subset of the more general co-...
متن کاملUsing 'Low-cost' Learning Features for Pronoun Resolution
We investigate a machine learning approach to Portuguese pronoun resolution. We presently focus on so-called ‘low-cost’ learning features readily obtainable from the output of a part-of-speech tagger, and we largely bypass deep syntactic and semantic analysis. Preliminary results show significant improvement in resolution precision and recall, and are comparable to existing rule-based approache...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004